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Abstract. 

A model independent method to search for particles of unknown masses in events with missing energy is presented. The 
only assumption is the topology of the decay chain. The method is tested in events with top pairs decaying leptonically with the 
presence of two neutrinos in the final state. Possible applications are searches for a heavy resonance decaying to top pairs as 
well as next generation heavy quarks. Similar topologies are predicted by supersymmetric models with R-parity conservation 
resulting in final states with two invisible neutralinos. 
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INTRODUCTION 

Could we find the mass of the top quark and W boson in 
LHC from top-pairs decaying leptonically in the hypo- 
thetical case in which both mt and mw were unknown? 
Could we establish a discovery by observing the mass 
peaks above background for both particles without any 
assumption about the underlying theory? 

Standard Model has predicted the mass of the W bo- 
son which was later measured accurately by LEP. Top 
mass has also been measured in both Tevatron and LHC. 
The reason that such a question is interesting is the sim- 
ilarity with BSM theory models which predict new in- 
visible particles (e.g neutralinos). It is believed by many 
physicists that such theories might be the next step in 
our understanding of the physical world as they provide 
(among others) an explanation for the nature of dark mat- 
ter. For example, supersymmetric models with R-Parity 
conservation predict event topologies with at least two 
neutralinos. The existance of two invisible particles in 
the decay chain makes the extraction of the unpredicted 
by the model masses a difficult problem [1], [2], [3]. Sim- 
ilar topologies might exist in the decays of a hypothetical 
heavier quark from a fourth generation or a heavy reso- 
nance decaying to top pairs. 

Observation of an excess of events with large missing 
energy would provide evidence for the existance of new 
physics but could not help further in our understanding of 
what the new physics is. Several MET-like observables 
have been proposed for this purpose (Met, Ht, Mgff 
etc). A conmion characteristic of all of them is that new 
physics would appear as an excess of events in the tail 
of a distribution. The establishment of a discovery would 
be a difficult problem due to the small difference in the 
shape between signal and background. In this case, a 
very good understanding of the tail would be necessary, 
a challenging task at least at an early stage. But let's say 



that LHC experiments observe an excess of events in the 
tail of a MET-like observable and have confidence in the 
result. What can we say about new physics except that it 
exist? 

Understanding the physics is much easier in possible 
BSM signals which do not predict any invisible particles. 
In this case, the invariant masses can be reconstructed up 
to combinatoric ambiguities. Knowledge of the masses 
provides full understanding of the event kinematics and 
subsequently allows boosts to the correct rest frames 
of the decaying particles. Angular distributions in these 
rest frames provide the best observables to discriminate 
between different spin hypothesis. Knowledge of both 
mass and spin of the new particles is a very important 
step towards understanding what the new physics is. 

In addition, signal events are contained in a small re- 
gion of mass space in contrast to the higher in dimensions 
kinematic phase space in which selection cuts are of- 
ten applied. For example, in the supersymmetric pseudo- 
scalar MSSM Higgs decay A ^ Zh,Z ^ 1+1" ,h -> bb 
the reconstruction of both A,h hypothetical Higgs bosons 
is possible. The new physics events would then be ob- 
served as a peak in the 2-dimensional mA, mh mass plane 
[4]. In this case, there is significant difference in shape 
between signal and background. The establishment of a 
discovery is an easier task as a data-driven estimation of 
the background can be performed from the sidebands. 

Searching for resonances in mass-space is model inde- 
pendent as the signal extraction does not need any change 
that depends on the hypothetical model. The selection 
applied can be simple energy thresholds to ensure cor- 
rect reconstruction as well as identification of the final 
state objects. 

It is often believed that reconstruction of particles 
masses in topologies with two invisible particles is not 
feasible. In the rest of this study a counter example 
is presented with the simultaneous reconstruction of W 
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FIGURE 1. (Left): Solvability for the correct solution of a single top-pair event in the 2-dimensional mass plane. (Right): 
Solvability weighted with the PDFs and normalized to unit volume for the same solution and event. 




°0 50 100 150 200 250 300 °0 50 100 150 200 250 300 

M,op (C3eV) M,„p (GeV) 



FIGURE 2. (Left): Solvability (for the correct solution) of 300 test events produced from the same single event by smearing of the 
momenta of leptons, jets and transverse missing energy. (Right): Average weighted solvability for the same test events normalized 
to unit volume. 



boson and top quark in the dilepton top-pair final state in 
a p-p collider like LHC. The method can also be used to 
search in a model independent way for a new generation 
t' and any resonance decaying to top pairs pp ^ X' ^ tt. 

EVENT SIMULATION AND SELECTION 

All Monte Carlo sample used in this study were gener- 
ated using Pythia 6 [5] except the production of a next 
generation quark via pp ff , qq t'P for which Pythia 
8 was used [6]. Top pair events with mt = 172.5 GeV 
and mw = 80.4 GeV were generated corresponding to an 
integrated luminosity of 1 fb~^. For the same intergrated 
luminosity samples of Drell-Yan (Z ^ e+e~, /i+/i~) and 
diboson backgrounds (WW, WZ, ZZ) were also created. 

The performance of the method to new resonances de- 
caying to top pairs was tested using a sample of 300 
pp ^ Z' ^ tt events, with the new gauge boson gener- 
ated with mz = 500 GeV and Tz = 15 GeV. Finally, for 



the search for a new generation quark, a sample of 300 
pp ff , qq t'F events was produced with mt = 250 
GeV. Pythia cross-sections were used in all cases except 
top pairs for which a more accurate calculation was used 

m. 



Jet reconstruction 

All stable particles except neutrinos were used as an 
input to Fastjet-2.4.2 in order to cluster particles into 
generated jets [8]. Jet reconstruction was performed with 
anti — Kt algorithm using a cone of 0.5. The generated 
jets were then smeared according to the jet reconstruction 
performance of a typical LHC detector [11]. The ratio of 
the reconstructed to the partonic jet energy was used to 
correct the jet energy scale. This was performed in 3 Pt 
bins (Ft < 50 GeV, 50 GeV< Ft <150 GeV, Ft > 150 
GeV) and 2 pseudorapidity bins (|r]| < 1.5, 1.5 < |t7| < 
2.5). The correction factors were calculated from the top- 




FIGURE 3. Average PDF weighted Solvability for the 300 
test events produced from the same initial top pair event. The 
distribution is normalized to unit volume. 

pair sample using the bjets matched with a bquark. All 
jets matched with a bquark were given 50% chance to be 
tagged as bjets. 

Leptons and Missing Energy 

Missing energy was calculated by summing vectori- 
ally all reconstructed jets, muons and electrons and then 
reversing the sign. Both jets and Met resolutions were 
checked to be as described in [11]. Muon and electron 
energies were smeared at the percent level which is a typ- 
ical resolution for an LHC detector. 



Event Selection 

Events were selected as top pair candidates by requr- 
ing all objects of the final state to be above an energetic 
threshold in order to be well reconstructed: at least two 
leptons (electrons or muons) with Pt > 20 GeV, and at 
least two bjets with Pt > 20 GeV. In case the event had 
more than two bjets or leptons, the two most energetic 
ones were selected. All jets and leptons were required to 
be in the pseudorapidity region \r{ \ < 2.5. Finally, events 
with transverse missing energy less than 20 GeV were 
rejected. Events satisfying the above selection were used 
as input to the algorithm described in the next section. 

THE METHOD 

The kinematics of tt dilepton events can be expressed by 
two linear and six non linear equations (Appendix). The 
system is solvable with respect to the unknown neutrino 
and antineutrino four vectors, provided that the masses 



of top quark and W boson are known. An analytical solu- 
tion for the equations of the top-pair system is described 
in [12], [13]. The transverse missing energy components, 
the momentum of bquarks and leptons as well as the 
masses of top quark and W boson are used as an input 
for the analytical solution algorithm. Each possible input 
can give 0, 2 or four different solutions for the unkown 
neutrino and antineutrino momentum components. In ad- 
dition, there are two possible combinations of bjets and 
leptons that could originate from the same top quark, giv- 
ing in total up to eight solutions. So, given the momenta 
of leptons and bjets together with METx^^ETy there can 
be up to eight possible neutrino and antineutrino momen- 
tum vectors for a given mt and mw- Knowledge of the 
momenta of the invisible neutrinos allows full kinematic 
reconstruction of the event including the four- vectors of 
W-bosons, top quarks and the tt system. 

The initial goal was to "rediscover" W boson and top 
quark by looking at LHC data without any assumption 
about the underlying theory, except the topology of top- 
pairs. As the masses of the particles are unknown the 
only option is to test every point of the mt, mw plane 
for possible solutions. The mass plane can be scanned 
in steps of 1 GeV to produce the area in which each 
one of the solutions exists or not. Solvability for a single 
event can be defined as the existance (or not) of a specific 
solution in a specific mass point. An example of such 
a solution area for a single top pair event is plotted in 
Figure 1 (left). By observing several signal events, it can 
be seen that solvability is bounded only from below for 
both mt and mw- 

Due to the finite collission energy there is also an up- 
per limit to the allowed masses produced. The center of 
mass energy of the partons partipipating in the hard scat- 
tering has to be smaller than the LHC collision energy. 
For a p-p collider this energy limit can be expressed fully 
by the parton distribution functions (PDFs) of the pro- 
ton in the following way: each solution allows full re- 
construction of event kinematics, including the estima- 
tion of the energy E and Pz momenta component of the 
tt system. These variables can be easily transformed to 
the fraction of beam energy of the two partons partici- 
pating in the hard scattering (xi^2 = E±pz/2). So each 
parton with fraction xi can be assigned with a probability 
F(xi) to originate from a proton-proton collision. By mul- 
tiplying the probabilities of the two partons a weight per 
mass point can be assigned for each solution. As there 
are more than one possible leading order parton-parton 
interactions (uu, uu, dd, dd, gg) weights from all possi- 
ble combinations are summed to estimate a final event 
weight per solution and mass point. The weight can be 
written as J^F(xi)F(xi), where the summation is over the 
parton combinations and F is the CTEQ6.1 PDF set with 
momentum transfer = m^ [9]. For the estimation of 
the PDF values the LHAPDF interface was used [10]. 




FIGURE 4. Reconstructed mt and mw per event for a sample of signal and background events corresponding to an intergrated 
luminosity of 1 fb~^ 



The PDF weight normalized to unit volume provides 
an upper bound for the mass values of both mt and 
mw. An example of solvability weighted by the PDFs 
for the same single top pair event is plotted in Figure 
1 (right). It is interesting to mention that the prefered 
mass point is not the one with the lowest mt and mw 
values as one might have guessed from the fact that 
PDFs favour lower mass values. Use of the solvability 
together with a matrix element weight which depends on 
the model has been proposed for top mass estimation in 
a single mass dimension [14]. This proposal has evolved 
to the matrix element weighting method in Tevatron 
[15]. What is proposed in this study is a general multi- 
dimensional search method in mass space using only 
model independent PDF weights rather than a top mass 
measurement method using matrix elements. 

Detector effects can change the momenta of the lep- 
tons and jets making a solvable event not- solvable. In 
many cases solvability can be recovered by smearing 
the leptons and jets according to detector resolution. For 
each initial event, N test events can be created by smear- 
ing the leptons, jets and missing energy components of 
the recorded event according to the detector resolutions. 
For these test events, solvability can be defined as the 
fraction of them for which a specific solution exist. Solv- 
ability of the 300 test events created by the initial single 
top pair event is plotted in Figure 2 (left). 

For each test event, solvability of a solution (which is 
either zero or one) can be multiplied with the PDF weight 
calculated for the specific solution and mass point. The 
value obtained is averaged over all 300 test events and 
normalized to unit volume. Such a distribution can be 
constructed for all possible solutions. An example is 
given in Figure 2 (right) for the correct solution of the ini- 



tial top pair event. Among all solutions, the one with the 
highest PDF weight is chosen. The final mt and mw es- 
timation for this event is the mass point where the distri- 
bution of the prefered solution is maximized (Figure 3). 
The above procedure gives a single mass point per event. 
The other option is to construct a complicated likelihood 
in order to exploit all the available information. A sin- 
gle mass entry is prefered so as to reconstruct invariant 
mass distributions in which robust discovery techniques 
can be applied. Is is worth emphasizing the power of the 
PDF weights to choose a prefered solution. Not all so- 
lutions are as likely to originate from a proton-proton 
collision and the parton distribution functions can dis- 
tinguish one of them. This might be applicable to other 
cases with combinatoric backgrounds such as reconstruc- 
tion of chains with visible particles. 

By applying the method in signal and background 
events corresponding to an intergrated luminosity of 1 
fb~^, the 2-dimensional mass distribution can be created 
(Figure 4). In order to produce the invariant mass distri- 
bution for the ligther particle, all events around the heav- 
ier one can be selected, according to the mass resolution 
of the detector. For W boson, all events with 150 GeV 
< mt < 190 GeV in the 2-dimensional mass distribution 
can be selected (Figure 5, left) . In a similar way for top 
quark, all events with 70 GeV <mw < 90 GeV are se- 
lected (Figure 5, right). The 2-dimensional mass distri- 
bution as well as the one-dimensional ones show that the 
method works: both top quark and W boson would be 
observed without any a priori knowledge of their masses 
or of the underlying theory, in a model independent way. 
The method described can be tested using LHC data. If 
it works for top pairs, it is likely to work for any of the 
searches described in the next section. 





FIGURE 5. Reconstructed mw when requiring mt to be in the range 150-190 GeV (left) as well as reconstructed mt when 
requiring mw to be in the range 70-90 GeV (right), for a sample of signal and background events corresponding to an intergrated 
luminosity of 1 fb~^. 



APPLICATIONS 
Search for a resonance decaying to top pairs 

An interesting application is the search for a heavy 
particle decaying to top pairs pp ^ X' ^ tt in the dilep- 
ton final state. The later has less background than the 
semileptonic and hadronic top-pair final states. In ad- 
dition, it allows easier boosted top reconstruction from 
its final state objects as top is reconstructed from a sin- 
gle bjet and a lepton rather than 3 closely spaced jets. 
In order to test the method, a sample of 300 events 
of pp ^ Z' ^ tt were created, with mz/=500 GeV and 
rz/=15 GeV. 

The average weighted solvability of the test events is 
used to get an estimated mt and mw per event, as de- 
scribed in the previous section. The solution and mass 
point with the highest PDF weight is chosen. A full kine- 



matic reconstruction can now take place for each of the 
test events: Starting from the neutrino momenta, the four- 
vectors of W boson and top quark can be calculated for 
both branches. Knowledge of the above allows the re- 
construction of the tt system and subsequently its invari- 
ant mass. The average m^^ of all solvable test events is 
used as an estimate of the mass of a possible X' tt de- 
cay (Figure 6). Such a resonance would be observed as 
a peak in the m^^ distribution, provided that its width is 
not very wide. The 2-dimensional reconstructed mt and 
mw per event can be seen in Figure 7 (left). The recon- 
structed m^/ for the same events is shown in Figure 7 
(right). The invariant mass distribution is an interesting 
observable to look for a heavy new particle as there are 
very few Standard Model events in the high mass range. 



Search for a new generation heavy quark 
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FIGURE 6. Reconstructed m^i for a sample of top pair and 
background events corresponding to an intergrated luminosity 
of 1 fb-i. 



The 2-dimensional mass reconstruction descibed is 
model independent and would work for any particle de- 
caying like top quark or W boson. So one could look for 
anything decaying as YY XbXb llbbViVi (Y Xb, 
X Ivi). The two chains can be identical or a parti- 
cle anti-particle pair as in both cases the masses are the 
same. 

In case Y is a next generation heavy quark t' the 
topology would be t'P W+bW~b 1+rbbViVi, 
(t' Wb). Another possibility is to search for X particle 
instead of Y, where X could be a charged higgs boson. 
To demonstrate the reconstruction of a t'P signal, a 
sample of 300 events were used with mt/=250 GeV. The 
2-dimensional mass reconstruction can be seen in Figure 
8 (left). The one-dimensional t' mass reconstruction can 
be seen in Figure 8 (right). 




FIGURE 7. Reconstructed mt and mw per event for a sample of 300 7! tt generated with mz' =500 GeV (left) as well as 
reconstructed m^/ for the same events (right). 
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FIGURE 8. Reconstructed m^/ and mw per event for a sample of 300 t't' W+bW b ^ 1+1 bbViVi events generated with m^/ 
=250 GeV (left) as well as reconstructed m^/ when requiring m^ to be in the range 70-90 GeV (right). 



Top pair identification 

Another possible application of the 2-dimensional 
mass reconstruction is the identification of top pair 
events. For many searches performed in final states with 
missing energy top pairs are the most significant SM 
background. The MET-fike observables used to establish 
a discovery have their tail populated by top pair events. 
So, by identifying them using mass constraints in the 2- 
dimensional mt and mw plane, we can supress signifi- 
cantly the most important source of a possible fake dis- 
covery. 

Top mass measurement 

The 2-dimensional mass reconstruction can also be 
used for an imporoved top mass measurement. The si- 
multaneous reconstruction of mw together with mt gives 
a handle to control the main systematic effect of the mea- 
surement: the jet energy scale. Calibration of this scale 
could be based on mw reducing significantly the effect 
on the reconstruction of mt. 



CONCLUSIONS 

Mass space is the natural space to search for new par- 
ticles. Observation of mass peaks above background al- 
lows robust discovery using data-driven background es- 
timation from the sidebands. But most important, recon- 
struction of the unknown masses gives valuable insights 
to what the new physics is. The search is model indepen- 
dent as the only assumption is the decay topology. As 
a proof of principle, mass peaks of both top quark and 
W boson can be produced from leptonic top pair decays 
from simulated events as well as using LHC data. It is 
shown that the method can be used to reconstruct mass 
peaks from heavy resonances decaying to top pairs as 
well as next generation heavy quarks decaying with the 
same event topology. Possible applications to supersym- 
metric mass reconstruction will be described in a future 
publication. Other applications include top pair identifi- 
cation in order to reject them from the tail of MsT-fike 
observables for supersymmetric searches, as well as use 
of the mw to control the jet energy scale for an improved 
top mass measurement. 
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APPENDIX 

The top pair event topology has the following con- 
straints: 

MeTx = PVx + PVx = Pvx + Pvy + PVz 

METy = PVy + PVy ^ = P^ + p\ ^ P^ 

= (Ei+ +Ev)^ - (Pi+ +pvj^ 
-(Pl++Pvy)^-(Pi++Pvz)^ 

m^_ = (Ei-+Evf-(Pi^-+Pvj' 

-(Pl-+PVyf -(Pl-+PVz)^ 

= (Eb + Ei+ +Ev)^ - (pb, +Pi+ +PvJ^ 

-(Pby +Pl+ +PVy)^ - (Pbz +Pl+ +PVz)^ 

mf = (Eb + Ei- +Ev)' - (ps, +Pi- +pvj' 

-(Pby +Pl- +PVy)^ - (Pbz +Pl- +PVz)^ 
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